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ABSTRACT 


Modal*  of  the  probability  distribution  representing  the  residuals  between  the  observed  MOF 
and  field  strength  end  the  corresponding  predicted  values  from  HFBC84 ,  an  Ionospheric  predic¬ 
tion  program  developed  for  the  Broadcast  WARC,  are  presented.  A  data  base  of  13,054  hours  of 
|  oblique  sounder  MOF*  measured  on  70  paths  was  used  to  obtain  the  residuals  In  the  predicted 
i  MUFs.  A  modified  version  of  CCIR  Data  Base  C  was  used  to  obtain  the  residuals  in  predicted 
field  strength;  only  the  81  paths  for  short  path  propagation  were  retained.  The  residuals  for 
these  models  were  fit  to  the  Johnson  system  of  frequency  curves.  This  system  of  curves  con¬ 
sists  of  three  distributions:  (1)  data  that  is  unbound,  called  the  Sy  distribution;  (2)  data 
that  is  bound  on  one  end,  called  the  S.  distribution;  and  (3)  data  that  Is  bound  on  both  ends, 
called  S_  distribution.  Overall  for  the  MUF  residuals,  it  was  determined  that  the  data  could 
|  be  represented  by  a  S,  distribution.  Overall  for  the  field  strength  residuals,  a  S„  distribu- 
i  tion  fit  the  data  for  frequencies  below  or  equal  the  predicted  MUF,  and  a  SR  distribution  fit 
the  data  for  frequencies  greater  than  the  MUF. 


INTRODUCTION 

I 

]  The  effective  operation  of  long  distance  high  frequency  (HF)  communication  systems  has  in¬ 

creased  in  proportion  to  the  ebllity  to  predict  variations  In  the  ionosphere.  These  variations 
!  are  affected  In  a  complex  menner  by  soler  activity,  seasonal  and  diurnal  changes,  as  well  as 
latitude  and  longitude.  Such  a  predictive  capability  has  permitted  communicators  to  optimize 
frequencies,  antennas  and  other  circuit  parameters.  The  need  for  HF  model  uncertainty  assess¬ 
ment  has  become  Important  as  more  and  more  uses  ar*  found  for  the  prediction  models.  This 
paper  differs  from  the  more  usual  report  on  the  accuracy  of  prediction  models.  Here,  a  model 
!  of  the  probability  distributions  representing  the  residuals  (the  errors)  between  the  observed 
j  parameters  and  the  corresponding  predicted  values  is  presented.  The  residuals  for  the  predic¬ 
tion  model  ar*  fit  to  a  Johnson  system  of  frequency  curves  (Johnson,  1949)  using  an  algorithm 
due  to  Hill  et  al.  (Hill  at  al . ,  1976;  Hill  and  Wheeler,  1981;  Dodgson  and  Hill,  1983)  which 
uses  the  method  of  moments  to  obtain  the  required  parameters.  This  distribution  represents  all 
univariate  distribution  systems;  Its  simplicity  of  calculation  once  Its  parameters  have  been 
determined  makes  It  adaptable  to  minicomputer  type  of  applications;  and  the  transformation  of 
the  Johnson  variables  to  the  normal  system  allows  use  of  normal  probability  algorithms  In  Its 
application. 

Hare,  the  probability  distribution  representing  the  error  in  the  predicted  MUF  and  field 
strength  from  HFBC84 ,  an  ionospheric  prediction  program  for  the  HF  Broadcasting  UARC,  is 
presented.  A  data  base  of  13,054  hours  of  oblique  sounder  MOFs  measured  on  70  paths  was  used 
to  obtain  the  residuals  In  the  predicted  MUFs.  A  modified  version  of  CCIR  Dot*  Base  C  was  used 
to  obtain  the  residuals  in  predicted  field  strength;  only  the  81  paths  for  short  path  propaga¬ 
tion  were  retained.  Detailed  documentation  of  the  determination  of  the  residuala  for  the 
predicted  MUF  asnd  field  strength  for  HFBC84  is  given  by  Sprague  and  Sailors  (1987),  Roy  and 
Sailors  (1987),  and  Sprague  (1987). 

The  first  section  of  the  body  of  the  paper  discusses  the  determination  of  the  statistical 
model.  In  addition  to  the  method  of  moments,  this  section  presents  alternate  methods  for 
determining  the  parameters  when  the  moments  are  large.  The  second  section  presents  tables  of 
parameters  representing  the  probability  distribution  of  HFBC84  predictions.  However,  because 
of  the  limited  scope  of  the  paper,  not  all  the  variations  studied  can  be  presented.  The  third 
section  discusses  application  of  the  statistical  model.  The  final  section  discusses  Improve¬ 
ments  that  could  be  made  In  the  technique  used  to  get  the  Johnson  curve  parameters. 
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DETERMINATION  OF  A  STATISTICAL  MODEL 

The  first  step  In  dateraining  a  probability  distribution  to  represent  the  residuals  be¬ 
tween  observed  data  and  the  corresponding  predicted  data  is  to  hypothesize  a  distribution 
function  and  deteralne  its  parameters.  In  the  case  of  the  application  at  hand,  the  determina¬ 
tion  of  the  distribution  function  was  motivated  by  (1)  a  desire  for  ease  of  calculation  so  that 
the  results  could  be  used  in  ainicoaputar  type  applications,  and  (2)  a  need  to  automate  the 
data  analysis.  The  distribution  function  chosen  waa  one  known  as  the  Johnson  curves  (Johnson, 
1949) . 

Johnson  Curves 

The  Johnson  curves  are  an  empirical  faaily  of  curves  satisfying  the  following  chosen  con¬ 
ditions:  (1)  they  should  be  easy  to  evaluate  once  their  parameters  are  determined;  (2)  they 
are  a  aonotonic  function  of  y  where  y  -  (x-f)/A,  A  is  a  acale  factor,  £  a  location  factor  of 
che  distribution,  and  x  is  Che  variable  being  represented  by  the  distribution;  (3)  the  range  of 
values  of  f(y)  corresponding  to  the  actual  range  of  values  of  y  should  be  from  -  «  to  +  ■»;  and 
(4)  the  resulting  system  of  distributions  of  y  (and  so  of  x)  should  include  distributions  of 
most,  is  not  all,  of  che  kinds  encountered  in  collected  data.  Johnson  proposed  basing  empiri¬ 
cal  distributions  on  che  transformation  of  a  standard  normal  variate.  The  Johnson  system  of 
frequency  curves  consist  of: 

the  lognormal  systaa  (or  S^) :  z  -  y  +  4  in  [(x-£)/A],  ?<x,  (1) 

the  unbounded  system  (or  Sy):  z  -  y  +  4  sinh ((x-£)/A],  (2) 

the  bounded  systea  (or  Sg):  z  -  y  +  4  in  [ (x-£)/(£+A-x) ] ,  £<x<£+A  (3) 

where  z  is  the  standardized  normal  variate  in  each  case.  The  parameters  y  and  4  determine  the 
shape  of  the  distribution  of  x. 

To  decide  which  of  the  three  Johnson  families  should  be  used  for  a  given  set  of  data,  the 
usual  procedure  is  to  obtain  che  data  estimates  of  the  skewness  J0,  and  the  kurtosis  0..  These 
then  are  plotted  on  a  figure  such  as  figure  1  which  shows  the  region  in  the  (0. ,  0.)  plane  for 
the  three  Johnson  families.  Also  shown  are  other  common  sampling  distributions.  If  the  (0., 
02)  point  is  close  Co  the  S,  curve ,  the  S,  curve  is  chosen.  If  it  is  in  the  region  above  the1 
S,  curve,  the  S.  family  is  chosen;  and  it  it  is  below  the  curve,  the  SyCurve  is  used.  The  S, 
carve  can  be  extended  by  use  of  the  paraaetric  equations;  L 

0X  -  (u-1)  (eH-2)2  (4) 

fi2  “  «•*  +  2  «3  +  3  «2  -  3  (5) 

„  2 

where  u  denotes  exp  (S  ) .  The  impossible  region  in  the  figure  is  bounded  by  the  line  0.  -  0  - 

1-0.  These  three  systems  of  curves;  S,  ,  Sy,  and  S_;  together  cover  the  entire  “possible" 
region  of  the  /).,  0  -  plane  (the  plane  describing  the  entire  possible  3rd  and  4th  moment 
variation) . 

Fitting  Johnson  Curves  by  Momenta 

To  determine  the  parameters  for  the  Johnson  curves,  an  algorithm,  called  JNSN,  known  by 
the  Royal  Statistical  Society  as  algorithm  AS  99,  vas  used  (Hill  et  al . ,  1976;  Hill  and 
tfheeler,  1981;  Dodgson  and  Hill,  1983;  Criffiths  and  Hill,  1985).  This  algorithm  uses  the 
sample  moments  (l.a.,  the  mean,  standard  deviation,  skewness  ( J0 ^) ,  and  che  kurtosis  (0 J  to 
determine  Che  type  of  Johnson  curve  and  its  parameters.  For  the  sake  of  completeness,  the  al¬ 
gorithm  includes  the  normal  curve  Itself  and  the  special  case  of  the  SB  curve  on  the  0.  -  0  ,  + 

1  boundary  which  is  called  S_  (T  standing  for  two - ordinate ) .  Equations  (4)  and  (5)  are  solved 
in  JNSN  Co  determine  the  type  of  Johnson  distributions.  If  the  required  0.  from  these  equa¬ 
tions  is  less  than  the  input  0-,  S_  (or  S_)  is  appropriate;  if  greater,  Sy  (a  appropriate.  The 
parameters  y,  4,  f ,  and  A  are  found  suchthat  estimated  values  of  J0 ^  ana  0 2  using  the 
parameters  are  within  ±  0.01  of  Che  input  velues. 
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Figure  1.  legion  la  (61.  Ox)  plana  for  tha  Johnson  System  of  Curves. 
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Alternate  Methods  for  Datermlnlnc  <*•  Parameters 

When  the  aoaents  are  large,  tha  methods  of  moments  la  not  always  efficient.  Hence,  It  Is 
desirable  to  have  alternate  methods  to  use  to  obtain  a  second  set  of  parameters.  A  test  of  fit 
can  be  used  to  choose  between  the  sets  of  parameters  so  chosen.  In  addition  to  the  aethods  of 
momenta,  two  additional  methods  of  estimation  are  maximum  likelihood  and  the  use  of  quantiles 
or  percentiles .  Depending  on  the  Johnson  curve  being  evaluated,  the  different  aethods  aay  or 
swy  not  be  possible. 

For  the  S.  curve,  the  maximum  likelihood  method  can  be  used  to  obtain  tha  parameters  7  and 
t  If  la  known  (Hahn  and  Shapiro,  1967).  If  {  la  unknown,  one  Is  tempted  to  sat  (  to  the  mln- 
laua  value  In  the  data  sat.  However,  as  f  tends  to  the  minimum,  the  maximum  likelihood 
estimates  tend  to  Infinity.  Thera  can  be  considerable  variation  of  {  with  little  variation  of 
the  percentiles.  This  Is  particularly  true  for  large  negative  values  of  f.  In  the  algorithm 

written  for  this  aathod,  {  was  set  to  min  (x,,  x,,  ...,  Xj.)  -  0.1  min  (x.  ,  x- . x„) .  If 

the  value  of  (  is  unknown,  the  unknown  parameter  7,  S,  and  (  can  be  obtalnedn>y  setting  three 
percentiles  determined  from  the  data  to  the  three  corresponding  percentiles  for  the  normal 
standard  variate  t  (Hahn  and  Shapiro,  1967).  Three  equations  of  the  form 

A  A  A 

*a  -  7  ♦  <  in  (xa-f)  (6) 

A  A  A 

are  solved  for  the  parameters  7,  (,  and  x  is  the  o  100th  percentile  for  a  standard  normal 
variate  and  x  Is  the  corresponding  percsntlleafrom  the  data.  Hahn  and  Shapiro  provide  solu¬ 
tions  to  these  equations  for  any  two  symmetric  percentiles  (e.g.,  the  a  100th  and  the  (l-o) 
100th  percentile)  along  with  the  median  value.  If  (x,  -x-  ,  )  <  (x,,  .-x),  then  a  negative 
value  of  (  Is  obtained  and  the  percentile  method  fallff^hrs  Is  very  urilikeiy  to  be  the  case  If 
the  distribution  has  substantial  positive  skewness. 
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For  the  Sy  Clio,  tha  only  alternate  aethod  Is  Che  aethod  of  quantiles.  In  this  case  all 
four  peraaeCers,  (,  A,  7,  and  S,  have  Co  be  estimated.  This  requires  Chat  esclaaces  be  ob¬ 
tained  of  four  values,  x.,  X-,  *c,  x-  say,  such  that  certain  fixed  proportions  P  P  P  P 
respectively,  of  tha  distribution  fall  below  these  values.  Than  the  parameters  aayBbe  round 
froa  the  four  equation  for  z.  ,  the  normal  standard  deviate  for  the  S..  case.  This  set  of  equa¬ 
tions  Is  non- linear  In  nature  and  currently  there  Is  no  algorithm  that  has  been  used  to  solve 
them. 


For  the  S.  curves,  both  the  aaxlaua  likelihood  and  method  of  percentile  aethods  can  be  ap¬ 
plied  (Johnson,  1949) .  If  the  parameters  (  and  A  ere  known,  then  the  aaxlaua  likelihood  method 
can  be  used  to  obtain  the  paraaetars  7  and  f.  If  only  the  lower  end-point,  ( ,  Is  known,  the 
aethod  of  percentiles  can  be  used  to  estimate  A,  7,  and  S.  Tha  method  of  percentiles  uses  the 
SMdlua  xo  and  the  lower  and  upper  100th  percentile  points  x^  and  x^. 

Verification  of  Model  Fit 

Tha  final  step  In  the  determination  of  a  distribution  representing  the  error  date  was  the 
verification  that  the  fit  was  adequate.  In  addition  tha  test  of  fit  was  used  to  determine  If 
Che  parasMters  produced  by  alternate  aethods  were  preferable.  The  procedurea  followed  for  the 
test  of  fit  was  that  for  the  chi-square  test  of  fit  (Hahn  and  Shapiro,  1967;  Williams,  1950). 
The  first  step  la  the  estimation  of  the  unknown  paraaetars  of  the  assumed  distribution  (Johnson 
curves).  After  sorting  the  data,  the  data  Is  divided  Into  k  classes  or  cells,  and  the  prob¬ 
ability  of  a  random  value  froa  the  assumed  model  falling  with  each  class  Is  deterained.  The 
number  of  classes  k  itself  is  selected  by  the  following  formula,  depending  on  the  sample  size  N 
and  the  level  of  significance  C 

k  -  4  (2(M-1)2/C2]1/5.  (7) 


For  the  5%  level  of  significance  C  -  1.645.  The  class  limits  are  chosen  such  that  each  class 
contains  the  saae  number  of  iteas  under  the  null  hypothesis  (l.a.,  a  Johnson  distribution). 

For  this  distribution,  the  class  boundaries  were  found  by  inputting  1/k,  2/k . (k-l)/k  into 

an  algorithm  for  finding  the  normal  deviates  corresponding  to  the  lower  tall  (Beasley  and 
Springer,  1977).  Then  an  algorithm  was  used  to  find  the  Johnson  deviates  corresponding  to  the 
normal  deviates  (Hill  and  Wheeler,  1981;  Codgson  end  Hill,  1983;  Hill,  1976).  These  Johnson 
deviates  so  deterained  are  the  cell  boundaries.  The  lower  boundary  of  the  first  cell  and  the 
upper  boundary  of  tha  last  cell  are  tha  smallest  and  largest  values  that  tha  random  variable 
nay  take  on.  The  call  boundaries  ware  sec  up  in  such  e  way  Chet  the  probability  of  a  random 
value  falling  within  a  given  class  la  estimated  to  be  1/k  for  each  class.  The  expected  number 
of  observations  for  each  cell  under  Che  assumed  distribution  Is  N/k.  Tha  chi-square  test 
statistic  .'.s  given  by 


2 


X 


k  ^ 
*  X 

N  l-l 


(8) 


where  f.  Is  tha  observed  number  of  frequencies  in  the  1th  class.  As  Che  sample  size  approaches 
Infinity,  the  distribution  of  this  statistic  approaches  Che  chi-square  distribution  with  k-l-s 
degrees  of  freedom  where  s  la  the  number  of  parameters  estimated  from  the  sample. 


For  a  one-sided  (upper  tall)  test  for  a  significance  level  of  5t,  ,  (°)  1*  determined 
using  a  Cable  of  critical  values  for  tha  chl-squara  distribution  for  degrees  of  freedom  less 
than  equal  tan  and  the  Cornlsh-Fisher  approximation  for  larger  degrees  of  freedom  (Goldberg  and 
Levine,  1945;  Zar,  1978).  Ratios  of  x  /Xu  1  .  (a)  greater  Chan  1  signify  that  the  observed 
data  contradicts  the  assumed  nodal.  This  ratio1!*  also  used  to  determine  which  of  the  aethods 
of  determining  the  Johnson  parameters  Is  preferable;  the  nethod  producing  the  lowest  ratio  is 
used.  The  chl-squara  teat  holds  for  sample  sixes  as  low  as  200  (Williams,  1950). 


Tha  chl-equare  test  depends  on  the  data  being  ungrouped.  When  data  Is  grouped,  the  chl- 
squara  test  will  fall.  That  Is  because  equations  (7)  and  (8)  assume  that  there  are  N  ungrouped 
data  points.  What  happens  Is  that  x  calculated  from  equation  (|)  is  large  and  the  hypothesis 
Is  rejected  when  In  fact  It  might  be  accepted.  This  is  because  *  Is  computed  on  the  assump¬ 
tion  that  every  cell  has  an  expected  number  not  Coo  snail.  Usually  5  Is  given  as  the  required 
alnlaua.  When  tha  data  Is  grouped,  then  some  calls  have  large  numbers  In  them  and  adjoining 
cells  have  e  number  lass  than  the  alnlaua  or  even  zero.  One  solution  is  to  Include  cells  with 
numbers  less  Chan  the  alnlaua  with  adjoining  cells,  and  increase  the  number  of  cells.  However, 
Che  number  of  cells  aust  not  exceed  N/5.  In  the  application  here  the  number  of  calls  was 
doubled  when  grouping  occurred. 


Awiiioi't  N<m« 


Start  Iwr*^. 
hr  ell  mrt 
ehor  4m  lint 


On  lirtt  poM 
tyH  title  •» 

MM*  fcw  » 


Awtfor  —  _, 
Affiliation 
Cty,  Stata 


i  i  r 


JOHNSON  CURVE  PARAMETERS  FOR  HFBC84 

The  Johnson  curve  parameters  were  determined  for  cho  predicted  MUF  and  flald  strength  for 
HFBC84.  For  tho  predicted  MUF,  tho  p*r«MCira  were  found, as  a  function  of  latitude  of  tha  con¬ 
trol  points,  of  data  recorder  type,  of  geographies  region,  of  each  particular  path  in  the  data 
base  where  sufficient  data  was  available,  of  circuit  path  length,  of  aid-path  local  tiae,  of 
monthly  assn  sunspot  nuaber,  of  aonth,  of  path  orientation,  of  season,  and  of  saoothed  sunspot 
nuabar.  A  Halted  set  of  these  results  are  presented  in  tables  1  to  6.  For  the  predicted  field 
strength,  tha  peraaetars  were  found  as  a  function  of  frequency  relative  to  predicted  MUF,  of 
each  particular  path  in  the  data  base  where  sufficient  data  was  available,  of  aid-path 
latitude,  of  circuit  length,  of  aid-path  local  tiae,  of  relative  path  orientation,  of  season, 
of  saoothed  sunspot  nuaber,  and  of  aid-path  local  tiae.  In  the  case  of  Che  field  strength,  the 
parsaetars  were  data  rained  for  frequencies  >  MUF  and  frequencies  S  MUF.  A  Halted  set  of  these 
results  Is  presented  in  tables  7  to  14.  In  the  case  of  the  field  strength  data,  the  alternate 
aethods  produced  tha  preferable  set  of  peraaetars  quite  often,  particularly  the  aaxtnua 
likelihood  aethod.  Whereas,  for  the  MUF  residuals,  only  rarely  did  an  alternate  aethod  to  the 
method  of  aoaents  produce  the  preferable  set  of  paraaeters. 

In  tables  1  to  14,  the  conditions  for  which  the  analysis  was  nade-Ce-g  ,  circuit  length 
2000-3000  km) ,  the  saaple  size,  the  four  input  aoaents,  7,  S.  X.  £,  the  X  /Xu.}.  (005)  ratio, 

the  class  size  k.  the  Johnson  type  curve  t,  and  the  aethod  of  analysis  a  are  given.  Type  -  1 
Implies  an  S,  curve,  type  -  2  Implies  an  $..  curve;  type  -  3  Implies  an  S-  curve;  type  -  <1  la- 
plies  a  normal  curve;  and  type  -  3  implies  an  S_  curve.  Method  a  -  1  Implies  aethod  of 
aoaents;  aethod  m  -  2  Implies  percentile  points;  an<f  method  m  -  3  lap  l  las  maximum  likelihood 
In  the  table  the  error  codes  imply:  (1)  1  »  residual  statistics  were  not  calculated;  (2)  2  - 
residual  data  is  not  valid;  (3)  3  -  chi- squared  test  of  fit  unable  to  run,  usually  not  enough 
data  points;  and  (4)  4  -  chi-squared  critical  value  not  determined  usually  not  enough  degrees 
of  freedom.  Only  the  error  codes  3  and  4  having  to  do  with  the  ch- square  test  of  fit  actually 
occurred.  This  usually  was  due  to  grouping  in  the  residual  data,  particularly  the  field 
strength  data.  Because  the  data  In  Date  Base  C  is  given  only  as  integer  values,  this  causes 
grouping  precisely  at  values  of  field  strength  where  the  nodal  Is  aost  accurate. 

APPLICATION  OF  THE  MODEL 

After  having  deterainad  the  Johnson  distribution  paraaeters  for  HFBC84  by  the  aethods 
described  earlier,  it  Is  desirable  to  be  able  to  apply  the  results.  It  Is  envisioned  thee  it 
alght  be  applied  in  one  of  two  ways.  The  first  is  that  for  a  given  probability.  It  might  be 
desired  to  know  the  error  In  the  model.  The  second  Is  given  a  certain  error  in  the  model,  what 
Is  the  corresponding  probability?  There  are  several  useful  algorithms  to  aid  in  this  (Beasley 
and  Springer.  1977;  Dodgson  and  Hill.  1983;  Griffiths  and  Hill,  19*5 ;  Hill,  1973;  Hill,  1976, 
Hill  and  Wheeler,  1981). 

In  the  first  application,  the  given  probability  Is  converted  to  the  corresponding  normal 
standard  deviate  using  the  algorithm  function  PPND  (Beasley  and  Springer,  1977;  Griffith  end 
Hill,  1983).  Then  the  corresponding  Johnson  deviates  are  found  using  the  algorithm  AJV 
(Dodgson  and  Hill,  1983;  Griffiths  and  Hill,  1963;  Hill,  1976;  Hill  and  Wheeler.  1981)  The 
parameters  necessary  as  input  can  be  found  for  the  case  and  nodal  under  consideration  from  the 
tables  in  the  previous  section.  The  Johnson  deviates  are  the  error  for  the  aodel  being  ap¬ 
plied. 

In  the  second  application,  the  given  error  is  converted  to  normal  standard  deviates  using 
the  second  algorithm  due  to  Hill  (1976;  Dodgson  and  Hill,  1983;  Griffiths  and  Hill,  1983:  Hill 
and  Wheeler,  1981)  called  SNV.  Then  the  corresponding  probability  level  can  be  found  using  a 
normal  integral  algorithm  (Hill,  1973)  called  ALNORM.  This  particular  algorithm  has  the 
capability  to  calculate  either  the  upper  or  lower  tall  area  of  the  standardized  normal  curve 
corresponding  to  any  given  argument. 

A  saaple  of  the  application  of  tho  aodel  is  given  in  figures  2-3.  In  these  figures  the 
predicted  residual  is  given  for  seven  different  standard  normal  deviates  (snv)  and  their  cor¬ 
responding  probability  levels.  The  residuals  range  froa  values  that  might  occur  0.1%  of  the 
time  to  99.9%  of  the  tine.  Figure  2  is  for  the  MUF  aodel,  and  shows  the  residual  variation  as 
a  function  of  path  ranga.  Figure  3  is  for  the  field  strength  model,  and  shows  the  residual 
variation  as  a  function  of  ratio  of  the  frequency  to  the  predicted  MUF. 
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DISCUSSION 

A  model  of  the  probability  distribution  representing  the  residuals  (errors)  between  the 
observed  para*~cera  and  tha  corresponding  predicted  MUF  and  field  strength  values  from  HFBC84 
was  presented.  The  residuals  for  this  program  wars  f^t  to  the  Johnson  system  of  frequency 
curves .  t 

In  the  cases  of  both  the  S..  end  S_  curve,  the  alternate  method  for  determining  the  desired 
parameters  when  all  four  are  unknown  involves  solving  four  equations  representing  the  normal 
standard  davlate  for  four  percentile  points.  Thess  systems  of  equations  ere  non-linear,  and  an 
algorithm  needs  to  be  developed  to  solve  these  equations  for  the  required  parameters.  Further, 
the  appropriate  percentile  points  to  be  used  in  the  analysis  needs  to  be  determined. 

Finally,  tha  chi-square  test  of  fit  employed  la  affected  by  the  grouping  of  data  at  par¬ 
ticular  values.  At  precisely  the  values  at  which  a  propagation  model  Is  most  accur&ce  Is  where 
the  grouping  occurs.  This  Is  due  in  part  to  the  lack  of  enough  precision  or  enough  significant 
digits  In  the  observed  data  used  to  evaluate  the  prediction  model.  The  chi-square  test  of  fit 
is  used  because  It  can  be  used  when  data  from  a  sample  is  used  to  generate  e  probability  dis¬ 
tribution  function.  Other  well  known  tests  of  fit  methods  csn  not  be  employed  In  this  case. 
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Models  of  the  probability  distribution  representing  the  residuals  between 
the  observed  MOF  and  field  strength  and  the  corresponding  predicted  values 
from  HFBC  84,  an  ionospheric  prediction  program  developed  for  the  Broadcast 
WARC ,  is  presented.  A  data  base  of  13,054  hours  of  oblique  sounder  MOFs 
measured  on  70  paths  was  used  to  obtain  the  residuals  in  the  predicted  MUFs. 
A  modified  version  of  CCIR  Data  Base  C  was  used  to  obtain  the  residuals  in 
predicted  field  strength;  only  the  81  paths  for  short  path  propagation  were 
retained.  The  residuals  for  these  models  were  fit  to  the  Johnson  system  of 
frequency  curves  (N.  L.  Johnson,  Biometrika  3iL,  149-176,  1949).  This 
distribution  was  chosen  because  the  transformation  of  the  Johnson  variables  to 
the  normal  system  allows  use  of  normal  probability  algorithms  in  its 
application  and  because  it  can  be  used  to  represent  all  univariate 
distributions.  The  Johnson  system  of  curves  consists  of  three  distributions: 
(1)  data  that  is  unbounded,  called  the  distribution;  (2)  data  that  is  bound 

on  one  end,  called  the  distribution;  anc’  (3)  data  that  is  bound  on  both 

ends,  called  the  distribution. 

The  residuals  for  these  models  were  fit  using  an  algorithm  due  to  Hill  et 
al  which  used  the  methods  of  moments  (Hill  et  al,  Appl.  Statist.,  180-189, 

1976).  Since,  when  moments  are  large,  the  method  of  moments  is  not  always 
efficient,  the  alternate  methods  of  maximum  likelihood  and  quantities 
estimation  were  used  to  obtain  additional  sets  of  Johnson  parameters.  Then  a 
chi-square  test  of  fit  for  a  5%  level  of  significance  was  used  to  determine 
the  best  fit  to  the  data. 

Overall  for  the  MUF  residuals,  the  data  can  be  represented  by  a 

distribution.  Parameters  were  also  found  for:  (1)  data  recorder  type,  (2) 
path  length  and  orientation,  (3)  season,  (4)  month,  (5)  latitude,  (6)  sunspot 
number,  (7)  diurnal  trends,  (8)  geographic  region,  and  (9)  particular  paths. 

Overall  for  the  field  strength  residuals,  a  Sy  distribution  fit  the  data 

for  frequencies  below  or  equal  the  predicted  MUF,  and  a  distribution  fit 

the  data  for  frequencies  greater  than  the  MUF.  The  parameters  were  also  found 
as  a  function  of  (1)  frequency/KUF  ratio,  (2)  particular  paths  in  data  base, 
(3)  path  length,  (4)  season,  (5)  sunspot  number,  (6)  mid-path  local  time,  and 
(7)  latitude  at  path  mid-point. 
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